Speed up (filtered) KNN queries for flat vector fields #130251

jimczi · 2025-06-27T18:49:31Z

For dense vector fields using the flat index, we already know a brute-force search will be used, so there’s no need to go through the codec’s approximate KNN logic. This change skips that step and builds the brute-force query directly, making things faster and simpler.

I tested this on a setup with 10 million random vectors, each with 1596 dimensions and 17,500 partitions, using the random_vector track. The results:

Performance Comparison

Metric	Before	After	Change
Throughput	221 ops/s	2762 ops/s	🟢 +1149%
Latency (p50)	29.2 ms	1.6 ms	🔻 -94.4%
Latency (p99)	81.6 ms	3.5 ms	🔻 -95.7%

Filtered KNN queries on flat vectors are now over 10x faster on my laptop!

For dense vector fields using the `flat` index, we already know a brute-force search will be used—so there’s no need to go through the codec’s approximate KNN logic. This change skips that step and builds the brute-force query directly, making things faster and simpler. I tested this on a setup with **10 million random vectors**, each with **1596 dimensions** and **17,500 partitions**, using the `random_vector` track. The results: ### Performance Comparison | Metric | Before | After | Change | | ----------------- | --------- | ---------- | --------- | | **Throughput** | 221 ops/s | 2762 ops/s | 🟢 +1149% | | **Latency (p50)** | 29.2 ms | 1.6 ms | 🔻 -94.4% | | **Latency (p99)** | 81.6 ms | 3.5 ms | 🔻 -95.7% | Filtered KNN queries on flat vectors are now over 10x faster on my laptop!

elasticsearchmachine · 2025-06-27T18:49:57Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2025-06-27T18:49:58Z

Hi @jimczi, I've created a changelog YAML for you.

…e_force_knn_optim

benwtrent

I am loving these numbers. Thank you for digging into this!

server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

server/src/main/java/org/elasticsearch/search/vectors/RescoreKnnVectorQuery.java

server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

…e_force_knn_optim

jimczi · 2025-06-30T08:25:01Z

I tweaked the brute-force nested version so that it always diversifies the child docs. It’s similar in spirit to what we do for HNSW, but much faster here since we can just walk through each parent’s block in order.

I ran a comparison between main and this branch using the nested mode in the random_vector track (added here)

|                                                Min Throughput | brute-force-filtered-search | 2052.39        |  85.0901      | -1967.3    |  ops/s |   -95.85% |
|                                               Mean Throughput | brute-force-filtered-search | 2214.2         |  87.3678      | -2126.83   |  ops/s |   -96.05% |
|                                             Median Throughput | brute-force-filtered-search | 2240.51        |  87.3315      | -2153.18   |  ops/s |   -96.10% |
|                                                Max Throughput | brute-force-filtered-search | 2250.5         |  88.3819      | -2162.12   |  ops/s |   -96.07% |
|                                       50th percentile latency | brute-force-filtered-search |    2.2724      |  79.5413      |    77.2689 |     ms | +3400.33% |
|                                       90th percentile latency | brute-force-filtered-search |    2.49625     | 136.674       |   134.178  |     ms | +5375.18% |
|                                       99th percentile latency | brute-force-filtered-search |    8.14283     | 199.337       |   191.195  |     ms | +2348.01% |
|                                     99.9th percentile latency | brute-force-filtered-search |   50.8515      | 265.505       |   214.654  |     ms |  +422.12% |
|                                    99.99th percentile latency | brute-force-filtered-search |   70.5917      | 359.416       |   288.824  |     ms |  +409.15% |
|                                      100th percentile latency | brute-force-filtered-search |   74.4209      | 438.119       |   363.698  |     ms |  +488.70% |
|                                  50th percentile service time | brute-force-filtered-search |    2.2724      |  79.5413      |    77.2689 |     ms | +3400.33% |
|                                  90th percentile service time | brute-force-filtered-search |    2.49625     | 136.674       |   134.178  |     ms | +5375.18% |
|                                  99th percentile service time | brute-force-filtered-search |    8.14283     | 199.337       |   191.195  |     ms | +2348.01% |
|                                99.9th percentile service time | brute-force-filtered-search |   50.8515      | 265.505       |   214.654  |     ms |  +422.12% |
|                               99.99th percentile service time | brute-force-filtered-search |   70.5917      | 359.416       |   288.824  |     ms |  +409.15% |
|                                 100th percentile service time | brute-force-filtered-search |   74.4209      | 438.119       |   363.698  |     ms |  +488.70% |
|                                                    error rate | brute-force-filtered-search |    0           |   0           |     0      |      % |     0.00% |

As you can see, the performance drops off pretty hard in the nested filtered case, lots of room for improvement. I think we should tackle that in a follow-up. From profiling, stuff like TimeOutCheckingBits is showing up a lot, so there’s some easy wins we can go after.

server/src/main/java/org/elasticsearch/search/vectors/DiversifyingParentBlockQuery.java

…130263) PR #130251 made me realize we were missing some important coverage. This adds nested vector query (and top level knn) tests for flat indices in our yaml tests.

benwtrent

OK, if this passes given the new yaml tests, I think this is g2g!

jimczi · 2025-06-30T15:41:02Z

OK, if this passes given the new yaml tests, I think this is g2g!

Well it's broken ;)
Let me check what's going on.

jimczi · 2025-06-30T16:14:03Z

Ok that was an issue when the diversifying query hits NO_MORE_DOCS.
Let's wait for this another round.

…lastic#130263) PR elastic#130251 made me realize we were missing some important coverage. This adds nested vector query (and top level knn) tests for flat indices in our yaml tests.

For dense vector fields using the `flat` index, we already know a brute-force search will be used—so there’s no need to go through the codec’s approximate KNN logic. This change skips that step and builds the brute-force query directly, making things faster and simpler. I tested this on a setup with **10 million random vectors**, each with **1596 dimensions** and **17,500 partitions**, using the `random_vector` track. The results: ### Performance Comparison | Metric | Before | After | Change | | ----------------- | --------- | ---------- | --------- | | **Throughput** | 221 ops/s | 2762 ops/s | 🟢 +1149% | | **Latency (p50)** | 29.2 ms | 1.6 ms | 🔻 -94.4% | | **Latency (p99)** | 81.6 ms | 3.5 ms | 🔻 -95.7% | Filtered KNN queries on flat vectors are now over 10x faster on my laptop!

jimczi requested review from benwtrent and carlosdelest June 27, 2025 18:49

jimczi added >enhancement :Search Relevance/Vectors Vector search v9.2.0 labels Jun 27, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Jun 27, 2025

Update docs/changelog/130251.yaml

99fb2bd

jimczi added 2 commits June 27, 2025 20:13

handle null index options

1ae65f9

Merge remote-tracking branch 'origin/brute_force_knn_optim' into brut…

d942f5a

…e_force_knn_optim

benwtrent reviewed Jun 27, 2025

View reviewed changes

jimczi and others added 5 commits June 27, 2025 22:13

handle nested fields

55c5cc7

nested fields should return docs at the nested level

379f7f9

[CI] Auto commit changes from spotless

61df4fa

First pass at diversifying results when the field is nested

3fdc47f

Merge remote-tracking branch 'origin/brute_force_knn_optim' into brut…

b476ad0

…e_force_knn_optim

benwtrent mentioned this pull request Jun 27, 2025

Add more test coverage for nested searches over flat vector indices #130263

Merged

jimczi and others added 9 commits June 28, 2025 00:05

collapse using a single pass

be25897

unused

ed9daad

iter

9a1b722

always apply diversification when nested and flat

2371ec3

Merge remote-tracking branch 'upstream/main' into brute_force_knn_optim

e73eea1

fix uts

d7dc423

fix ut

e1abb0c

add more tests

bda1c73

[CI] Auto commit changes from spotless

49d29ba

benwtrent reviewed Jun 30, 2025

View reviewed changes

server/src/main/java/org/elasticsearch/search/vectors/DiversifyingParentBlockQuery.java Show resolved Hide resolved

Merge branch 'main' into brute_force_knn_optim

0fb7e45

benwtrent approved these changes Jun 30, 2025

View reviewed changes

don't forget to set the current doc to exhausted too

62215c4

fix expectation in test

1fedae0

jimczi merged commit 2142915 into elastic:main Jun 30, 2025
32 checks passed

jimczi deleted the brute_force_knn_optim branch June 30, 2025 18:19

benwtrent mentioned this pull request Jul 9, 2025

Can we make brute-force faster in the highly filtered case? #130927

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up (filtered) KNN queries for flat vector fields #130251

Speed up (filtered) KNN queries for flat vector fields #130251

Uh oh!

jimczi commented Jun 27, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

benwtrent left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jimczi commented Jun 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

benwtrent left a comment

Uh oh!

jimczi commented Jun 30, 2025

Uh oh!

jimczi commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

Speed up (filtered) KNN queries for flat vector fields #130251

Speed up (filtered) KNN queries for flat vector fields #130251

Uh oh!

Conversation

jimczi commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance Comparison

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jimczi commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

jimczi commented Jun 30, 2025

Uh oh!

jimczi commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

jimczi commented Jun 27, 2025 •

edited

Loading

jimczi commented Jun 30, 2025 •

edited

Loading